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a widely believed girth conjecture by Erdos. 
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1 Introduction 



Computing shortest path distances in graphs is a fundamental algorithmic problem and has 
received a lot of attention for several decades. Classical algorithms include that of Dijkstra 
which handles graphs with non-negative edge weights, and the algorithm of Bellman-Ford 
which is slower but applies also when negative edge weights are present. A considerable 
drawback of these algorithms is that they are too slow for many applications. For instance, a 
GPS system needs to report shortest path distances extremely fast in very large road networks. 
If Dijkstra's algorithm is used, in the worst case the entire network would need to be explored 
just to compute a single distance. Another problem is that the whole graph would have to be 
stored in memory. If the graph is dense, it might not fit in the main memory and this would 
slow down computations considerably. 

One way to speed up computations is to precompute distances between all pairs of vertices 
in a preprocessing step and store them all in a look-up table. Distance queries can then 
be answered in constant time. The fastest known all-pairs shortest paths algorithm has 
only marginally subcubic running time [5]. For sparser graphs, repeated applications of 
Dijkstra's algorithm yield 0{mn + n'^ log 7i) time. Pettie [12] gave a slightly improved bound 
of 0{mn + log log n). Even for sparse graphs, these algorithms are too slow for many 
applications. Another disadvantage of this scheme is the huge amount of memory required to 
store the look-up table for all the distances. 

1.1 Approximate distances 

A way to deal with these issues is to settle for approximate shortest path distances. For a 
directed graph G, a distance from a vertex u to a vertex v along some path in G is of stretch 
5 > 1 if the path is at most 5 times longer than a shortest path from u to v m. G. 

Zwick [T7] showed how to compute all-pairs stretch (1 + e)-distances in directed graphs in 
Q^^2.376^ time for an arbitrarily small constant e > 0. In the seminal paper of Thorup and 
Zwick [15], it was shown how to preprocess an undirected graph in close to linear time to 
build a data structure of near-linear size which reports small-stretch distances very fast. More 
precisely, for an undirected graph G with non-negative edge weights, m edges, and n vertices 
and for any integer A; > 1, a data structure of size 0{kn^'^^/^) can be built in 0{kmn^/^) 
time which gives distances of stretch at most 2A; — 1 in 0{k) time. Since this data structure 
has constant query time for k = 0(1), it is referred to as an approximate distance oracle. We 
emphasize that the result only holds for undirected graphs; as shown by Thorup and Zwick, 
no compact distance oracles exist in general for directed graphs. 

The Thorup-Zwick oracle is randomized. Roditty, Thorup, and Zwick [13] showed how to 
obtain a deterministic algorithm with only a polylogarithmic increase in preprocessing time. 
Mendel and Naor |10j showed how to improve query time of the Thorup-Zwick oracle to 0(1) 
and space to 0{n^^^/^) at the cost of a larger stretch of 0{k) and a longer preprocessing time 
of 0(n2+Vfc \ogn). 

For small A;, the size/stretch tradeoff of the Thorup-Zwick oracle is essentially optimal 
due to a (widely believed and partially proved) girth conjecture of Erdos from 1963 [9]. The 
only possible improvement is thus in the time for preprocessing. It was shown by Baswana 
and Kavitha [2] that an oracle with the same size and stretch can be computed in O(n^) 
time which is an improvement when the number of edges m is uj{v?'~^^^ /k) and it is the first 
essentially optimal oracle for m = 0(n^). 
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1.2 Contributions of this paper 

The main contribution of this paper is to break the quadratic time bound of Baswana and 
Kavitha [2] for A; > 6 and m = o{n'^) and to break the Thorup-Zwick bound |15j when the 
graph is not too sparse and k not too small. We show that there exists a constant c such that 
for any integer k > 1, a {2k — l)-approximate distance oracle of size 0{kn^'^^^^) and with 
query time 0{k) can be constructed in 0{\/km-\-k'n}~^'^/^) time. When m = Q,{'n}'^'^/^) and 
k = 0(1), preprocessing time is linear and our construction is thus optimal in every respect 
(stretch, size, preprocessing time, and query time), assuming the girth conjecture. The oracle 
of Baswana and Kavitha only has linear preprocessing for m = Q{v?) and the Oikm-n}/^) 
preprocessing of Thorup and Zwick is super-linear for any constant k. 

We also present an oracle which is faster for smaller k. When k > 3, k mod 3 = 0, its 
preprocessing time is 0{km + A;n^/^+^/'^). We get similar bounds for other values of /c > 3: 
when k mod 3 = 1, preprocessing is Oikm + rc'/'^^^l'y'^^^~^^/^'^'^^~^))) and when k mod 3 = 2, it 
is 0{km + 71'^/'^+'^/ (''~^)~^/ . In particular, we achieve subquadratic preprocessing for 
all k>6. 

Our algorithms are very simple to describe and analyze, given previous black boxes. As 
in earlier approaches, we make use of random vertex sampling. We apply a result of Baswana 
and Kavitha to sparsify our graph w.r.t. this sampling while preserving exact distances 
between pairs of vertices that are close in some sense. We construct a Thorup-Zwick oracle 
for this sparser subgraph, allowing us to report {2k — l)-approximate distances between such 
vertex pairs. For pairs that are farther apart, we make use of spanners. For a 5 > 1, a 
6-spanner of a graph G is a subgraph that spans all vertices and preserves distances between 
all vertex pairs up to a factor of 5. To construct our oracle with small preprocessing time for 
small k, we run the linear-time algorithm of Baswana and Sen [4] to get a spanner of small 
stretch. We apply Dijkstra to get exact distances in this spanner between all pairs of sampled 
vertices. These distances together with distances from vertices to their nearest sampled vertex 
in the original graph allows us to report stretch {2k — l)-distances also for vertices far apart. 

Our oracle with near-linear preprocessing time for larger k does not run Dijkstra but 
instead constructs, on top of a spanner, what we call a restricted oracle. This oracle only 
allows us to query distances between sampled vertices but is more compact than the Thorup- 
Zwick oracle. We pick the stretch of the spanner and the restricted oracle to be @{^/k) and 
we show that the oracle gives stretch {2k — l)-distances in the underlying graph. 

We have summarized previous results on distance oracles as well as our new results in 
Tabled! 

1.3 Related work 

A problem related to distance oracles is that of finding spanners. We have already mentioned 
the linear-time algorithm of Baswana and Sen [3] to find a spanner of stretch 2k — 1. There 
has also been interest in so-called {a, /3)-spanners, where a and f3 are real numbers. Such 
a spanner H of a graph G ensures that for all vertices u and v, dH{u,v) < adQ{u,v) + /?. 
In other words, H allows an additive stretch in addition to a multiplicative stretch. Thorup 
and Zwick |16] showed the existence of (1 -|- e, /3)-spanners of size 0(n^+^/'^) for any constant 
k, where /3 = {c/e)^ for some constant c. A (1, 2)-spanner of size was presented 

by Dor, Halperin, and Zwick (7|. The size was later improved slightly by Elkin and Peleg 
to 0(n^/2) [8]. Baswana, Kavitha, Mehlhorn, and Pettie [3] gave a spanner of size 0(n^/^) 
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Table 1: Performance of distance oracles in weighted undirected graphs. 
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which has additive stretch 6 and no multipHcative stretch. This is currently the smallest 
known spanner with constant additive stretch and no multiplicative stretch. 

Allowing additive stretch in oracles has also been considered. For unweighted graphs, 
Baswana, Gaur, Sen, and Upadhyay [1] showed how to get subquadratic construction time by 
allowing constant additive stretch in addition to a multiplicative stretch of k. Patra§cu and 
Roditty [11] showed that for unweighted graphs there exists an oracle of size 0(n^/^) which has 
multiplicative stretch 2 and additive stretch 1. Furthermore, they showed that for weighted 
graphs with m = r? ja edges, there is an oracle of size 0(n^/a^/^) with multiplicative stretch 
2 and no additive stretch. Preprocessing time was not considered in [llj . 

The organization of the paper is as follows. In Section [2l we give some basic definitions 
and notation as well as some tools that will prove useful. In Section [3l we give the stretch 
(2k — l)-oracle which is fast for small k. Our near-linear time oracle is then presented in 
Sectional Finally, we make some concluding remarks in Section [5l 

2 Definitions, Notation, and Toolbox 

Let G = (y, E') be an undirected graph with non-negative edge weights. For our problem, we 
may assume that all edges have strictly positive weight since zero-weight edges can always be 
contracted. Also, we shall only consider connected graphs. For vertices u, f E y, we denote 
by dciujv) the shortest path distance in G between u and v. 

For a real value d > 1, a 6 -spanner of G is a subgraph H = (V, Eh) of G spanning all its 
vertices such that for any distinct vertices u and v, dH{u,v) < Sdciu^v). 

Let S* be a non-empty subset of V . For a vertex n, let ps{u) be the vertex of S nearest 
to u w.r.t. do (ties are resolved arbitrarily). We denote by Bs{u) the set of vertices v with 
dciujv) < dciu^psiu)). Let Es{v) denote the set of edges incident to v with weight less 
than dciy^psiy)). We define Eg = yJv&vEs{v) and Gs = {V^ Es). We need the following two 
simple results. 

Lemma 1. Given an undirected graph G = (y,E) with m edges and n vertices and given a 
non-empty subset S of V , ps{u) and dG{u,ps{u)) can he computed in 0[m + nlogn) time 
over all vertices u £ V . 

Proof. Connect a new vertex s with a zero-weight edge to each vertex of S. Run Dijkstra 
(implemented with Fibonacci heaps) from s in this augmented graph. For each vertex u £ V, 
ps{u) is the unique ancestor of u belonging to 5 in the shortest path tree found and the 
distance from s to ps{u) in the tree equals dG{u,ps{u)). □ 

Lemma 2. Given an undirected graph G = (V, E) with m edges and n vertices and given a 
non-empty subset S ofV, Gs can be computed in 0(m + n log n) time. 

Proof. Apply Lemma[I]to identify ps{u) and dG{u,ps{u)) for each u £ V. Then Es{u) can 
be found in time proportional to the degree of u. Hence, Gs can be found in 0{m) time in 
addition to the 0{m + nlogn) time from Lemma [TJ □ 

The following result is due to Baswana and Kavitha (see Lemmas 2.2 and 2.3 in [2]). 

Lemma 3. Let G = {V, E) be an undirected n-vertex graph with positive edge weights and let 
5" C 5" / 0. For any two vertices u,v £ V , if v £ Bs{u) then dGg{u,v) = dG{u,v). If S 
is obtained by picking each vertex independently with probability p, then Es has expected size 
0{n/p). 
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3 A {2k — l)-Approximate Distance Oracle 



In this section, we present a {2k — l)-approximate distance oracle with subquadratic prepro- 
cessing time for /c > 6. As a warm-up, we first present an 0{k) stretch oracle with near-linear 
preprocessing. It is a trivial combination of the linear time spanner of Baswana and Sen [3] 
and the Thorup-Zwick oracle |15j . This idea is probably quite common knowledge and was 
noted by Sen [H]. However, the bounds obtained do not appear to be stated explicitly in the 
literature so we include them here. Later in this section and in Section [H we shall refine this 
idea in order to get optimal tradeoff between size and stretch. 

Theorem 1. Let G he an undirected graph with m edges and n vertices and let e > be a con- 
stant. For any integer k > 1, an 0{k) -approximate distance oracle for G of size 
can be constructed in 0{km-\-kn^~^^^^~^'') time and can answer distance queries in 0{k) time. 

Proof. We compute in 0{km) time a spanner H oi G with stretch at most [1/e] = and 
with niH = 0{n^'^'^^^^/'''^) = 0{n^~^'') edges using the linear time algorithm of Baswana and 
Sen [4]. We then construct a Thorup-Zwick oracle of stretch 2A; — 1 on top of H. It has size 
0{kn^~^^/^), query time 0{k), and preprocessing time 0{kmHn^^^) = 0{kn^^^/^^''). Since it 
has stretch 0{k) and H has stretch 0(1), the theorem follows. □ 

3.1 Preprocessing 

We now present our {2k — l)-approximate distance oracle and start with the preprocessing 
step. Each vertex is sampled with probability p = n~'^/^, for some < i < /c to be specified; 
we allow i to be a non-integer. Let S be the set of sampled vertices. We construct Gs 
in 0{m + nlogn) time using Lemma [2l The expected size of 5" is pn = n^~'^/^ and by 
Lemma [31 the expected size of Es is 0{n/p) = 0{n^+^/^). We can rerun the sampling until 
\S\ = Q{n^~'^/^) and \Es\ = 0{v}^^/^); by Markov's inequality, only a constant expected 
number of reruns is needed for this. 

We compute and store both ps{u) and dG{u,ps{u)) for all u £V. By Lemmadl this can 
be done in 0{m + nlogn) time. 

Next, we build the distance oracle of Thorup and Zwick for the graph Gs = {V, Es). This 
takes 0{k\Es\n^^'') = 0{kn^~^^'^'^^^^'') expected time and requires 0{kn^~^^^'^) space. For some 
integer k' (to be specified), we apply the 0{k'm) time algorithm of Baswana and Sen [1] to 
find a {2k' — l)-spanner H = {V^Eh) of G with \Eh\ = 0{k'v}^^/^ ) edges. For each pair 
of sampled vertices p,q £ S, we compute and store dH{p,q). The latter is done by running 
Dijkstra in H from each sampled vertex. Implementing Dijkstra with Fibonacci heaps, this 
takes a total of 0{\S\{\Eh\ + nlogn)) = ©(/c'n^+Vfc'-iA') time. The space required to store 
all the 5 X 5 distances is 0{\S\'^) = 0{r?-'^^l^). 

It follows from the above that total expected preprocessing time is 0{k'm + kn^'^^'^'^^'^^^ + 
y^2+i/k -j/fe-j ^YiQ amount of space needed for our oracle is 0(A;n^~^^/^' + n^~^*/^). 

3.2 Answering a distance query 

Now, let us consider how to answer a distance query for vertices u^v & V , given the above 
preprocessing. In constant time, we look up vertices ps{u) and ps{v) as well as distances 
dG{u,ps{u)) and dG{v,ps{v)). We first query the distance oracle associated with Gs and 
get a distance estimate di{u,v) in 0{k) time. We then obtain the precomputed value 
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dH{psiu),Psiv)) in constant time and obtain another distance estimate d2{u, v) = dG{u,ps{u)) + 
dHiPsiU'),ps{v)) + dG{v,ps{v)). The smahest of di{u,v) and d2{u,v) is then output as the 
answer to the query. 

3.3 Bounding stretch 

Let dG{u,v) = mm{di{u,v),d2{u,v)} denote the distance estimate that our query step out- 
puts. We show in the following that for a suitable choice of k' , dciu^v) < dG{u,v) < 
{2k — l)dG{u,v). The first inequality is clear since both di{u,v) and d2{u,v) are the weights 
of some u-v-paths in G; in particular, they are both at least as long as a shortest u-v-path. 
In the following, we show dG{u,v) < {2k — l)dG{u,v). 

If n G Bs{v) oi V £ Bs{u) then by Lemma [3l dGs{u, v) = dG{u, v). In this case, the oracle 
for Gs outputs di{u,v) < {2k — l)dG{u,v) so our query algorithm will output dG{u,v) < 
di{u,v) < {2k — l)dG{u,v), as desired. 

Now assume that u ^ Bs{v) and v ^ Bs{u). Then dG{u,ps{u)),dG{v,ps{v)) < dG{u,v) 
and hence 

dH{ps{u),ps{v)) < {2k' - l)dG{ps{u),Ps{v)) 

< {2k' - l){dG{ps{u),u) + dG{u,v) + dG{v,ps{v))) 

< {6k' - 3)dG{u,v), 

sod2{u,v) = dG{u,ps{u)) + dH{ps{u),Ps{v)) + dG{v,ps{v)) < {6k' - l)dG{u,v). Hence, if we 
choose k' = [k/3\, our query algorithm gives the desired stretch, i.e., dG{u,v) < d2{u,v) < 
{2k — l)dG{u, v). Here we need to assume that k > 3 since k' needs to be at least 1. 

3.4 Running time 

Now, let us bound running time. Our query step runs in 0{k) time as mentioned above. With 
our choice of k' , preprocessing takes 0{km + fcn^"'"^*"'"^)/'^ _|_ j^^+^/L'^/^J"*/'^) time. We consider 
three cases, k mod 3 = 0, k mod 3 = 1, and k mod 3 = 2, and we will fix a value of i in each 
of them that minimizes preprocessing time. In the first case, k' = k/3 and we set i = k/2 + 1, 
in the second case k' = {k — l)/3 and we set i = {k — l)/2 + 3k/{2{k — 1)), and in the third 
case k' = {k — 2)/3 and we set i = {k — 2) /2 + {2k — l)/{k — 2). The time and space bounds 
we obtain from these choices are stated in the following theorem. 

Theorem 2. Let G he an undirected graph with m edges and n vertices and let k > 3 be 
an integer. If k mod 3 = 0, a {2k — 1)- approximate distance oracle for G of size 0{kn^~'^^/'') 
can be constructed in 0{km + kn^/'^'^'^/^) time and can answer queries in 0{k) time. If 
k mod 3=1 resp. k mod 3 = 2, the same size and query bounds hold and construction time 

is 0{km + A:?l3/2+l/(2fc)+3/(2(fc-l))) ^^^^^ qQ^^ ^ ^.,^3/2+2/(fc-2)-l/(fc(fc-2)))_ 

We see that Theorem [2] breaks the quadratic time bound of Baswana and Kavitha [2] 
when A; > 6. 

4 Near-Linear Time Oracle 

In this section, we give our {2k — l)-approximate distance oracle that breaks the preprocessing 
time of the previous section for larger k. It achieves a time bound arbitrarily close to linear 
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when k is sufficiently large. We shall modify the oracle of Section [3] and the modification 
we make is in the distance computations in the spanner H between sampled vertices in S. 
Instead of Dijkstra, we build an approximate distance oracle on top of H and then use it to 
report distances. However, we cannot apply this idea directly since we aim for 0{kn^~^^/^) 
space; if we built the Thorup-Zwick oracle of that size on top of H, it would give stretch 
2k — 1 which for our oracle would be multiplied with the stretch of H. Instead, we will use 
the fact that the oracle only needs to report distances between vertices of S, allowing us to 
improve the Thorup-Zwick space/stretch tradeoff. 

Overview of the Thorup-Zwick oracle: First, let us very briefly go through the algo- 
rithm of Thorup and Zwick. For an integer k > 1, to build a size 0(Kn^"'"^/'*) oracle with 
stretch 2k — 1, sets Aq, . . . , are formed, with V = Aq ^ Ai ^ A2 ■ ■ ■ ^ A^ = 0. For 
i = — 1, set Ai is formed by picking each element of independently with prob- 

ability rT^I^ . For i = 0, . . . , K — 1, distance dci^Ai^ v) = min^„g^^ dciw, v) is computed for 
each vertex v and a vertex pi{v) of Ai achieving this distance is kept. Then so called bunches 
B{v) are formed around each vertex v and are defined by: 

B{v) = yj^lQ{w £ Ai\Ai+i\dG{w,v) <dG{Ai+i,v)}. 

The oracle answers a query for the approximate distance between vertices u and v by 
repeatedly checking whether suitably chosen vertices belong to bunch B{u) or bunch B[v). 
More precisely, it starts by setting w := u and initializes a counter i := 0. Then as long as 
w ^ B{v), it increments i, swaps u and v and updates w := Pi{u). When w E B{v) (which 
will happen at some point) dciw^u) + dciw^v) is returned as approximate distance. 

Improving space: The space requirement of the Thorup-Zwick oracle is dominated by the 
size of the bunches B{v). For our application, we consider a restricted version of this oracle 
where we only care about queries between pairs of vertices from set S. It follows from the 
query step of the Thorup-Zwick oracle that we only need to store bunches B{v) for v G S" to 
answer such queries. As shown by Thorup and Zwick, each bunch has size O^Kri^/'^). Hence, 
to build our restricted oracle, we only need 0{k\S\v}/'^) space0 

Our oracle: We get our near-linear time oracle by combining ideas from Section [3] with the 
restricted oracle above. As before, set S is formed by sampling each vertex with probability 
p = n~^l^ and we construct the Thorup-Zwick oracle for graph Gs = {V,Es). We build a 
{2k' — l)-spanner H = {V,Eh) with 0{k'v}^^^'^') edges. Now, instead of applying Dijkstra 
to find the exact distance dnip^q) in H between each pair of samples p,q £ S, we build a 
restricted {2k, — l)-stretch oracle for H w.r.t. set S. It is built with the same procedure as 
that of Thorup and Zwick but only bunches B{v) for v £ S are kept. Total preprocessing for 
the oracles for Gs and H is then 0{k'm + A;n^+(*+^)/'^ -|- Kk'n^^^^^''^^^'^). We shall specify i, 
k' , and k below. 

To answer a distance query for vertices u and v, we do as in Section[3]but instead of using 
the exact spanner distance dH{ps{u),Ps{v)) we use that obtained by the restricted oracle. 

^In fact, using the source-restricted distance oracle in [13], we can reduce space further to 0{k\S\^'^^'''^) 
by keeping 0(151) bunches each of size 0(k|5'|^/''). This also gives a slightly improved preprocessing time. 
Unfortunately, |S| is too big for this result to give any significant improvement of our oracle. 
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The analysis for bounding the stretch for such a query is almost identical to that in 
Section [331 We only need to consider the case u ^ Bs{v) and v ^ Bs{u) and we get a stretch 
bounded by 2 + 3{2k' — 1){2k — 1) since we apply a {2k — l)-approximate distance oracle 
instead of Dijkstra. 

The size of our new oracle is bounded by the size of the oracle for Gs and the restricted 
oracle for H (note that we do not need to store H after the restricted oracle has been 
built). The first oracle requires 0{kn^~^^^'') space and as we saw above, the restricted oracle 
requires 0{\S\Kn^^'^) = 0(^72^"'*/'^+^/'') space. Since we only allow space 0{kn^~^^^^), we 
need k > k/{i + 1). To minimize stretch, we pick k as small as possible while satisfying this 
inequality, i.e., we pick k = \k/{i + 1)]. Substituting this for k and requiring that stretch is 
at most 2 A; — 1, we get the inequality 



2 + 3{2k' - 1) ( 2 



k 



i + 1 



1 <2k-l^k' < 



k + 3(^ 


k 
i+1 




6 


k 
i+1 


- 3 



To minimize running time, we pick k' as large as possible while satisfying this inequality, i.e., 



we set k' = [ 



k+3{l 



i+1 



-1) 



(for this to make sense, we need k' > 1 but we shall choose i and 



k so that this is ensured). 

We are aiming for a preprocessing time of 0{k'm + k'n}'^'^/^) for some constant c > 0. 
We shall pick c such that an i (possibly non-integer) can be found satisfying 



2Vk 



< 



k 



i + 1 



< 



k 



i + l 



< 



z\fk 



(1) 



To ensure this, it suffices to require that 2^/k/c + 1 < c-v/fc/18. Multiplying by c on both 
sides, we get 

Vk 



-(? - c - 2^fk >0<^c>4= + 9 
18 ~ ~ ^fk 



1 4 



Hence, since > 1, picking c = 9 + 3VT3 will allow us to pick an i satisfying inequalities ([T|). 
We claim that this choice of % gives the desired preprocessing time bound. Assume that 
since we are only interested in subquadratic bounds. Then 



A;' 



A: + 3( 


k 

i+1 




6 


k 
i+1 




- 3 



A; + 3 



> 



i+1 



c\fk Q 



1 > 



3 



3\/A; - c ^ 2^fk 



Since also k = \k/{i + 1)] '>k/{i + l)> 2^/k/c and nk' = 0{k), total preprocessing time is 
0(A;'m+A:n^+(^+^)/'=+KA:'n^+^/'^'+i/'^) = ©(Vfem+fen^+^/^^v^^+fen^+^Z^) = 0{^m+kn^+'^l'^), 
as desired. We can now state the main result of the paper. 

Theorem 3. Let G he an undirected graph with m edges and n vertices and let k > 1 
be an integer. Then a {2k — 1)- approximate distance oracle for G of size can 
be constructed in 0{^fkm + kn^~^^^^) time, for some constant c, and can answer distance 
queries in 0{k) time. 
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The bound we gave on constant c above was not very tight. In the following, let us bound 
this constant for large k. Instead of picking i such that inequalities ([1]) are satisfied, pick it 
such that K = \k/{i + 1)] > k/{i + 1) = y/kj^- Then 



k' 



k + 3(^ 


k 
i+1 


-0 


6 


k 

i+1 




- 3 



> 



k + - 3 



k + 3x^ 



A;-3a/| -6 



> 



6^/1 + 3 



6/C + 3 



which is at least 1 when k > 29. Hence 



- + T7 < \ T + 



6k + 3 



K k 



k 



A;-3a/I -6 



and it follows that we can pick c arbitrarily close to 2^/6 if k is bounded from below by a 
sufficiently large constant. 



5 Concluding Remarks 

For an undirected graph G with m edges, n vertices, and non- negative edge weights, and for 
an integer > 1, the main result of this paper is a {2k — l)-approximate distance oracle of G 
having size query time which can be constructed in 0{\/km + k'n}'^'^/^) 

time for some constant c. We also gave an oracle with faster preprocessing for smaller k. 
Together, these two results break the quadratic preprocessing time of Baswana and Kavitha 
for > 6 and improve the 0{kmn^/^) bound of Thorup and Zwick when G is not too sparse 
and k not too small. 

Assuming the girth conjecture, our oracle is optimal in every respect when m = Jl(n^"'"'^/^) 
and k = 0(1) since then preprocessing time is linear. Whether linear preprocessing time is 
achievable also for sparser graphs remains an open problem. Our oracles break the quadratic 
preprocessing bound when k > 6. What is possible for smaller k? Finally, we believe that 
by using existing techniques, it should be possible to derandomize our oracles at only a small 
increase in preprocessing time. 
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